Skip to content

Add requesting-human-help skill for structured human-in-the-loop collaboration#673

Open
pratyush618 wants to merge 2 commits intoobra:mainfrom
pratyush618:feat/requesting-human-help-skill
Open

Add requesting-human-help skill for structured human-in-the-loop collaboration#673
pratyush618 wants to merge 2 commits intoobra:mainfrom
pratyush618:feat/requesting-human-help-skill

Conversation

@pratyush618
Copy link

Summary

  • Implements the skill requested in issue Skill for requesting help from human partners. #594
  • Adds skills/requesting-human-help/SKILL.md with a structured human-in-the-loop pattern
  • Covers two trigger categories: capability/access boundaries and high-risk irreversible actions
  • Provides a standardized request format (goal, context, prerequisites, steps, expected evidence, acceptance criteria)
  • Includes response validation algorithm, audit chain, red flags, and common mistakes

Closes #594

…aboration

Implements skill from issue obra#594: structured, evidence-driven requests
for capability limits and high-risk actions, with validation chain and
audit trail from request through human action to agent decision.
@coderabbitai
Copy link

coderabbitai bot commented Mar 10, 2026

📝 Walkthrough

Walkthrough

Adds a new skill documentation file that defines a standardized, evidence-driven workflow and template for agent requests for human assistance, covering required fields, validation rules, audit-trail chaining, red flags, usage scenarios, and a common mistakes table with fixes.

Changes

Cohort / File(s) Summary
Human Help Skill
skills/requesting-human-help/SKILL.md
New comprehensive documentation introducing a structured request format (Goal, Involvement level, Why I can't do this, Context, Prerequisites, Steps, Expected output/evidence, Acceptance criteria, If something goes wrong), usage scenarios and do-not-use cases, human-response validation requirements, explicit audit trail model (REQUEST → HUMAN ACTION → EVIDENCE → AGENT DECISION), red-flag halt conditions, and a Common Mistakes table with concrete fixes.

Sequence Diagram(s)

sequenceDiagram
  participant Agent as Agent
  participant Human as Human
  participant EvidenceStore as EvidenceStore
  participant Auditor as Auditor

  Agent->>Human: Submit structured REQUEST (goal, steps, prerequisites, acceptance criteria)
  Human->>EvidenceStore: Perform action & upload EVIDENCE (screenshots, logs, outputs)
  Human-->>Agent: Respond with EVIDENCE + human notes
  Agent->>EvidenceStore: Validate EVIDENCE against acceptance criteria
  alt Evidence meets criteria
    Agent->>Auditor: Log chain: REQUEST → HUMAN ACTION → EVIDENCE → AGENT DECISION
    Agent-->>Human: Confirm completion / continue workflow
  else Evidence missing or fails
    Agent-->>Human: Minimal follow-up request for missing pieces or escalate
  end
Loading

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐇 I asked with checklist, clear and bright,
A human helper came into sight.
With screenshots, logs, and proof in hand,
We finish tasks just as we planned.
Hops of progress — neat and right!

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Title check ✅ Passed The title accurately and concisely describes the main change: adding a skill for structured human-in-the-loop collaboration for requesting help.
Description check ✅ Passed The description is directly related to the changeset, explaining the implementation of the requesting-human-help skill and its key features.
Linked Issues check ✅ Passed The PR implements all key requirements from issue #594: standardized request format, validation algorithm, evidence requirements, audit chain, red flags, and common mistakes table.
Out of Scope Changes check ✅ Passed All changes are directly related to the requesting-human-help skill implementation requested in issue #594; no unrelated modifications detected.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@skills/requesting-human-help/SKILL.md`:
- Around line 38-66: Add a language tag (use "text") to the triple-backticked
fenced code blocks in the "Human Help Needed" section of SKILL.md to satisfy
markdownlint MD040; locate the fenced blocks that start/stop around the "##
Human Help Needed" content (and the additional blocks noted at ranges
corresponding to the acceptance criteria and REQUEST examples) and change the
opening "```" to "```text" for each block so all fenced sections are annotated
consistently.
- Around line 34-66: Add an explicit "Human involvement level" field to the
request template so each help request records whether the human is being asked
for "Clarification", "Execution", or "Approval/Takeover": update the structured
block under "## Human Help Needed" to include a line like "**Human involvement
level:** [Clarification | Execution | Approval/Takeover]" and, where relevant,
ensure "Steps", "Expected output / evidence needed", and "Acceptance criteria"
reference or differentiate behavior based on that level (e.g., approval requires
explicit confirm/accept output while clarification may require only answers), so
reviewers can unambiguously determine the authority granted by the human
response.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 38f9aabf-63f4-498d-93b7-bac4b6b92ec0

📥 Commits

Reviewing files that changed from the base of the PR and between 33e55e6 and 00a4b36.

📒 Files selected for processing (1)
  • skills/requesting-human-help/SKILL.md

…uage tags

- Add explicit involvement level (clarification/execution/approval) to
  the request template so the audit trail records what authority the
  human response grants
- Add 'text' language tag to all fenced blocks to satisfy MD040
Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

♻️ Duplicate comments (1)
skills/requesting-human-help/SKILL.md (1)

62-103: ⚠️ Potential issue | 🟠 Major

Make evidence and validation branch on the involvement level.

approval/takeover is now named, but the template still treats its proof the same way as execution. For approval flows, the key artifact is the explicit authorization itself, including scope/constraints; otherwise REQUEST → HUMAN ACTION → EVIDENCE → AGENT DECISION still doesn't prove what authority was granted.

Suggested doc update
 **Expected output / evidence needed:**
-- [What to capture: screenshot, log output, command result, confirmation text]
-- [Format: paste text output, attach screenshot, confirm yes/no]
+- clarification: [Direct answer, plus cited source/screenshot if needed]
+- execution: [Artifact proving the step ran: screenshot, log output, command result]
+- approval/takeover: [Exact approval/takeover statement, scope, constraints, and who owns the action]

 **Acceptance criteria:**
-- [ ] [Specific, verifiable condition that means "this worked"]
-- [ ] [What distinguishes success from partial success]
+- [ ] [Criteria appropriate to the involvement level]
+- [ ] [For approval/takeover: authorization is explicit and scoped]

 IF all criteria met:
   → State: "Confirmed: [criterion 1], [criterion 2]. Proceeding."
+  → For approval/takeover, restate exactly what was approved and any limits before continuing.

 REQUEST → [structured block above]
 HUMAN ACTION → [what they did]
 EVIDENCE → [artifact they returned]
-AGENT DECISION → [what you decided based on evidence]
+AGENT DECISION → [what you decided based on evidence, including approval scope/limits]
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@skills/requesting-human-help/SKILL.md` around lines 62 - 103, Summary: The
validation template treats approval/takeover like execution and doesn't require
explicit authorization artifacts; update the "Validating the Human Response"
logic to branch on involvement level and require scope/constraints evidence for
approval flows. Fix: In the "Validating the Human Response" section, add
branching that checks the involvement type (e.g., "approval/takeover" vs
"execution") and for approval flows require an explicit authorization artifact
(authorization text, scope/constraints, signer identity) as evidence; for
execution flows keep the current evidence rules (logs/screenshots/outputs). Also
update the guidance text and the REQUEST→HUMAN ACTION→EVIDENCE→AGENT DECISION
audit chain to note that for approval flows the EVIDENCE must include the
authorization scope/constraints and signer, and adjust instructions to "If any
criterion unmet → Request ONLY the missing piece" to reflect the
involvement-specific missing artifact handling.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@skills/requesting-human-help/SKILL.md`:
- Around line 62-103: Summary: The validation template treats approval/takeover
like execution and doesn't require explicit authorization artifacts; update the
"Validating the Human Response" logic to branch on involvement level and require
scope/constraints evidence for approval flows. Fix: In the "Validating the Human
Response" section, add branching that checks the involvement type (e.g.,
"approval/takeover" vs "execution") and for approval flows require an explicit
authorization artifact (authorization text, scope/constraints, signer identity)
as evidence; for execution flows keep the current evidence rules
(logs/screenshots/outputs). Also update the guidance text and the REQUEST→HUMAN
ACTION→EVIDENCE→AGENT DECISION audit chain to note that for approval flows the
EVIDENCE must include the authorization scope/constraints and signer, and adjust
instructions to "If any criterion unmet → Request ONLY the missing piece" to
reflect the involvement-specific missing artifact handling.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 0f564151-447d-4966-abaa-a17244137054

📥 Commits

Reviewing files that changed from the base of the PR and between 00a4b36 and 57d42cc.

📒 Files selected for processing (1)
  • skills/requesting-human-help/SKILL.md

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Skill for requesting help from human partners.

1 participant